首页> 外文OA文献 >Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks

【2h】

Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks

机译：集合压缩：一种新的深度神经网络并行训练方法网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallelization framework has become a necessity to speed up the training ofdeep neural networks (DNN) recently. Such framework typically employs the ModelAverage approach, denoted as MA-DNN, in which parallel workers conductrespective training based on their own local data while the parameters of localmodels are periodically communicated and averaged to obtain a global modelwhich serves as the new start of local models. However, since DNN is a highlynon-convex model, averaging parameters cannot ensure that such global model canperform better than those local models. To tackle this problem, we introduce anew parallel training framework called Ensemble-Compression, denoted as EC-DNN.In this framework, we propose to aggregate the local models by ensemble, i.e.,averaging the outputs of local models instead of the parameters. As most ofprevalent loss functions are convex to the output of DNN, the performance ofensemble-based global model is guaranteed to be at least as good as the averageperformance of local models. However, a big challenge lies in the explosion ofmodel size since each round of ensemble can give rise to multiple times sizeincrement. Thus, we carry out model compression after each ensemble,specialized by a distillation based method in this paper, to reduce the size ofthe global model to be the same as the local ones. Our experimental resultsdemonstrate the prominent advantage of EC-DNN over MA-DNN in terms of bothaccuracy and speedup.

机译：最近，并行化框架已成为加快深度神经网络（DNN）训练的必要条件。这样的框架通常采用ModelAverage方法，称为MA-DNN，其中并行工作者根据他们自己的本地数据进行相应的培训，同时定期传递和平均本地模型的参数以获得全局模型，该全局模型用作本地模型的新起点。但是，由于DNN是高度非凸模型，因此平均参数无法确保此类全局模型的性能优于局部模型。为了解决这个问题，我们引入了一个新的并行训练框架Ensemble-Compression，称为EC-DNN。在这个框架中，我们建议通过集成来聚合局部模型，即对局部模型的输出而不是参数进行平均。由于大多数流行的损失函数对DNN的输出都是凸的，因此基于集成的全局模型的性能至少要与局部模型的平均性能一样好。但是，最大的挑战在于模型尺寸的爆炸式增长，因为每一轮合奏都可以使尺寸增大几倍。因此，我们在每个集合之后进行模型压缩，本文采用基于蒸馏的方法进行专业化，以将全局模型的大小减小为与局部模型相同。我们的实验结果证明，在准确性和速度方面，EC-DNN优于MA-DNN。

著录项

作者
Sun, Shizhao; Chen, Wei; Bian, Jiang; Liu, Xiaoguang; Liu, Tie-Yan;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类
入库时间 2022-08-20 21:10:11

相似文献

外文文献
中文文献
专利

1. 一种新的机场目标检测深度神经网络压缩模型 [J] . 吕宗磊, 潘芙兮, 徐先红南京航空航天大学学报（英文版） . 2020,第004期
2. EC-DNN: A new method for parallel training of deep neural networks [J] . Sun Shizhao, Liu Xiaoguang Neurocomputing . 2018,第APRa26期

机译：EC-DNN：一种用于深度神经网络并行训练的新方法
3. Traffic Network Flow Prediction Using Parallel Training for Deep Convolutional Neural Networks on Spark Cloud [J] . Zhang Yongnan, Zhou Yonghua, Lu Huapu, IEEE transactions on industrial informatics . 2020,第12期

机译：在火花云上使用平行训练的交通网络流预测
4. Privacy-Preserving Computation Offloading for Parallel Deep Neural Networks Training [J] . Mao Yunlong, Hong Wenbo, Wang Heng, IEEE Transactions on Parallel and Distributed Systems . 2021,第7期

机译：Piltacy-Presting计算并行深神经网络训练的卸载
5. Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks [C] . Shizhao Sun, Wei Chen, Jiang Bian, European conference on machine learning and principles and practice of knowledge discovery in databases . 2017

机译：集成压缩：并行训练深度神经网络的一种新方法
6. Robust Training Methods for Deep Neural Networks with a Variety of Label Noise [D] . Kamabattula, Sree Ram. 2021

机译：具有各种标签噪声的深神经网络的强大培训方法
7. Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data [O] . Alexios Koutsoukas, Keith J. Monaghan, Xiaoli Li, 2017

机译：深度学习：研究深度神经网络的超参数并将性能与浅层方法进行生物活性数据建模的比较
8. Distributed Training of Deep Neural Networks: Theoretical and Practical Limits of Parallel Scalability [O] . Keuper, Janis, Pfreundt, Franz-Josef 2016

机译：深度神经网络的分布式训练：理论与实践并行可伸缩性的限制

Ensemble-Compression: A New Method for Parallel Training of Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅